57 research outputs found

    Illuminating Choices for Library Prep: A Comparison of Library Preparation Methods for Whole Genome Sequencing of Cryptococcus neoformans Using Illumina HiSeq.

    Get PDF
    The industry of next-generation sequencing is constantly evolving, with novel library preparation methods and new sequencing machines being released by the major sequencing technology companies annually. The Illumina TruSeq v2 library preparation method was the most widely used kit and the market leader; however, it has now been discontinued, and in 2013 was replaced by the TruSeq Nano and TruSeq PCR-free methods, leaving a gap in knowledge regarding which is the most appropriate library preparation method to use. Here, we used isolates from the pathogenic fungi Cryptococcus neoformans var. grubii and sequenced them using the existing TruSeq DNA v2 kit (Illumina), along with two new kits: the TruSeq Nano DNA kit (Illumina) and the NEBNext Ultra DNA kit (New England Biolabs) to provide a comparison. Compared to the original TruSeq DNA v2 kit, both newer kits gave equivalent or better sequencing data, with increased coverage. When comparing the two newer kits, we found little difference in cost and workflow, with the NEBNext Ultra both slightly cheaper and faster than the TruSeq Nano. However, the quality of data generated using the TruSeq Nano DNA kit was superior due to higher coverage at regions of low GC content, and more SNPs identified. Researchers should therefore evaluate their resources and the type of application (and hence data quality) being considered when ultimately deciding on which library prep method to use

    On the effective depth of viral sequence data.

    Get PDF
    Genome sequence data are of great value in describing evolutionary processes in viral populations. However, in such studies, the extent to which data accurately describes the viral population is a matter of importance. Multiple factors may influence the accuracy of a dataset, including the quantity and nature of the sample collected, and the subsequent steps in viral processing. To investigate this phenomenon, we sequenced replica datasets spanning a range of viruses, and in which the point at which samples were split was different in each case, from a dataset in which independent samples were collected from a single patient to another in which all processing steps up to sequencing were applied to a single sample before splitting the sample and sequencing each replicate. We conclude that neither a high read depth nor a high template number in a sample guarantee the precision of a dataset. Measures of consistency calculated from within a single biological sample may also be insufficient; distortion of the composition of a population by the experimental procedure or genuine within-host diversity between samples may each affect the results. Where it is possible, data from replicate samples should be collected to validate the consistency of short-read sequence data

    Genomic analysis of extended-spectrum beta-lactamase (ESBL) producing Escherichia coli colonising adults in Blantyre, Malawi reveals previously undescribed diversity

    Get PDF
    Escherichia coli is one of the most prevalent Gram-negative species associated with drug resistant infections. Strains that produce extended-spectrum beta-lactamases (ESBLs) or carbapenemases are both particularly problematic and disproportionately impact resource limited healthcare settings where last-line antimicrobials may not be available. A large number of E. coli genomes are now available and have allowed insights into pathogenesis and epidemiology of ESBL E. coli but genomes from sub-Saharan Africa (sSA) are significantly underrepresented. To reduce this gap, we investigated ESBL-producing E. coli colonising adults in Blantyre, Malawi to assess bacterial diversity and AMR determinants and to place these isolates in the context of the wider population structure. We performed short-read whole-genome sequencing of 473 colonising ESBL E. coli isolated from human stool and contextualised the genomes with a previously curated multi-country collection of 10 146 E. coli genomes and sequence type (ST)-specific collections for our three most commonly identified STs. These were the globally successful ST131, ST410 and ST167, and the dominant ESBL genes were blaCTX-M, mirroring global trends. However, 37 % of Malawian isolates did not cluster with any isolates in the curated multicountry collection and phylogenies were consistent with locally spreading monophyletic clades, including within the globally distributed, carbapenemase-associated B4/H24RxC ST410 lineage. A single ST2083 isolate in this collection harboured a carbapenemase gene. Long read sequencing demonstrated the presence of a globally distributed ST410-associated carbapenemase carrying plasmid in this isolate, which was absent from the ST410 strains in our collection. We conclude there is a risk that carbapenem resistance in E. coli could proliferate rapidly in Malawi under increasing selection pressure, and that both ongoing antimicrobial stewardship and genomic surveillance are critical as local carbapenem use increases

    Yaws re-emergence and bacterial drug resistance selection after mass administration of azithromycin: a genomic epidemiology investigation.

    Get PDF
    BACKGROUND: In a longitudinal study assessing the WHO strategy for yaws eradication using mass azithromycin treatment, we observed resurgence of yaws cases with dominance of a single JG8 sequence type and emergence of azithromycin-resistant Treponema pallidum subspecies pertenue (T p pertenue). Here, we analyse genomic changes in the bacterial population using samples collected during the study. METHODS: We did whole bacterial genome sequencing directly on DNA extracted from 37 skin lesion swabs collected from patients on Lihir Island, Papua New Guinea, between April 1, 2013, and Nov 1, 2016. We produced phylogenies and correlated these with spatiotemporal information to investigate the source of new cases and the emergence of five macrolide-resistant cases. We used deep amplicon sequencing of surveillance samples to assess the presence of minority macrolide-resistant populations. FINDINGS: We recovered 20 whole T p pertenue genomes, and phylogenetic analysis showed that the re-emerging JG8 sequence type was composed of three bacterial sublineages characterised by distinct spatiotemporal patterns. Of five patients with resistant T p pertenue, all epidemiologically linked, we recovered genomes from three and found no variants. Deep sequencing showed that before treatment, the index patient had fixed macrolide-sensitive T p pertenue, whereas the post-treatment sample had a fixed resistant genotype, as did three of four contact cases. INTERPRETATION: In this study, re-emergence of yaws cases was polyphyletic, indicating multiple epidemiological sources. However, given the genomic and epidemiological linkage of resistant cases and the rarity of resistance alleles in the general population, azithromycin resistance is likely to have evolved only once in this study, followed by onward dissemination. FUNDING: Wellcome and Provincial Deputation of Barcelona

    Genomic epidemiology of syphilis in England: a population-based study.

    Get PDF
    BACKGROUND: Syphilis is a sexually transmitted bacterial infection caused by Treponema pallidum subspecies pallidum. Since 2012, syphilis rates have risen dramatically in many high-income countries, including England. Although this increase in syphilis prevalence is known to be associated with high-risk sexual activity in gay, bisexual, and other men who have sex with men (GBMSM), cases are rising in heterosexual men and women. The transmission dynamics within and between sexual networks of GBMSM and heterosexual people are not well understood. We aimed to investigate if whole genome sequencing could be used to supplement or enhance epidemiological insights around syphilis transmission. METHODS: We linked national patient demographic, geospatial, and behavioural metadata to whole T pallidum genome sequences previously generated from patient samples collected from across England between Jan 1, 2012, and Oct 31, 2018, and performed detailed phylogenomic analyses. FINDINGS: Of 497 English samples submitted for sequencing, we recovered 240 genomes (198 from the UK Health Security Agency reference laboratory and 42 from other laboratories). Three duplicate samples (same patient and collection date) were included in the main phylogenies, but removed from further analyses of English populations, leaving 237 genomes. 220 (92·8%) of 237 samples were from men, nine (3·8%) were from women, and eight (3·4%) were of unknown gender. Samples were mostly from London (n=118 [49·8%]), followed by southeast England (n=29 [12·2%]), northeast England (n=24 [10·1%]), and southwest England (n=15 [6·3%]). 180 (76·0%) of 237 genomes came from GBMSM, compared with 25 (10·5%) from those identifying as men who have sex with women, 15 (6·3%) from men with unrecorded sexual orientation, nine (3·8%) from those identifying as women who have sex with men, and eight (3·4%) from people of unknown gender and sexual orientation. Phylogenomic analysis and clustering revealed two dominant T pallidum sublineages in England. Sublineage 1 was found throughout England and across all patient groups, whereas sublineage 14 occurred predominantly in GBMSM older than 34 years and was absent from samples sequenced from the north of England. These different spatiotemporal trends, linked to demography or behaviour in the dominant sublineages, suggest they represent different sexual networks. By focusing on different regions of England we were able to distinguish a local heterosexual transmission cluster from a background of transmission in GBMSM. INTERPRETATION: These findings show that, despite extremely close genetic relationships between T pallidum genomes globally, genomics can still be used to identify putative transmission clusters for epidemiological follow-up. This could be of value for deconvoluting putative outbreaks and for informing public health interventions. FUNDING: Wellcome funding to the Sanger Institute, UK Research and Innovation, National Institute for Health and Care Research, European and Developing Countries Clinical Trials Partnership, and UK Health Security Agency

    Global phylogeny of Treponema pallidum lineages reveals recent expansion and spread of contemporary syphilis.

    Get PDF
    Funder: Queensland GovernmentSyphilis, which is caused by the sexually transmitted bacterium Treponema pallidum subsp. pallidum, has an estimated 6.3 million cases worldwide per annum. In the past ten years, the incidence of syphilis has increased by more than 150% in some high-income countries, but the evolution and epidemiology of the epidemic are poorly understood. To characterize the global population structure of T. pallidum, we assembled a geographically and temporally diverse collection of 726 genomes from 626 clinical and 100 laboratory samples collected in 23 countries. We applied phylogenetic analyses and clustering, and found that the global syphilis population comprises just two deeply branching lineages, Nichols and SS14. Both lineages are currently circulating in 12 of the 23 countries sampled. We subdivided T. p. pallidum into 17 distinct sublineages to provide further phylodynamic resolution. Importantly, two Nichols sublineages have expanded clonally across 9 countries contemporaneously with SS14. Moreover, pairwise genome analyses revealed examples of isolates collected within the last 20 years from 14 different countries that had genetically identical core genomes, which might indicate frequent exchange through international transmission. It is striking that most samples collected before 1983 are phylogenetically distinct from more recently isolated sublineages. Using Bayesian temporal analysis, we detected a population bottleneck occurring during the late 1990s, followed by rapid population expansion in the 2000s that was driven by the dominant T. pallidum sublineages circulating today. This expansion may be linked to changing epidemiology, immune evasion or fitness under antimicrobial selection pressure, since many of the contemporary syphilis lineages we have characterized are resistant to macrolides

    Genotypic Diversity Is Associated with Clinical Outcome and Phenotype in Cryptococcal Meningitis across Southern Africa.

    Get PDF
    Cryptococcal meningitis is a major cause of mortality throughout the developing world, yet little is known about the genetic markers underlying Cryptococcal virulence and patient outcome. We studied a cohort of 230 Cryptococcus neoformans (Cn) isolates from HIV-positive South African clinical trial patients with detailed clinical follow-up using multi-locus sequence typing and in vitro phenotypic virulence assays, correlating these data with clinical and fungal markers of disease in the patient. South African Cn displayed high levels of genetic diversity and locus variability compared to globally distributed types, and we identified 50 sequence types grouped within the main molecular types VNI, VNII and VNB, with 72% of isolates typed into one of seven 'high frequency' sequence types. Spatial analysis of patients' cryptococcal genotype was not shown to be clustered geographically, which might argue against recent local acquisition and in favour of reactivation of latent infection. Through comparison of MLST genotyping data with clinical parameters, we found a relationship between genetic lineage and clinical outcome, with patients infected with the VNB lineage having significantly worse survival (n=8, HR 3.35, CI 1.51-7.20, p=0.003), and this was maintained even after adjustment for known prognostic indicators and treatment regimen. Comparison of fungal genotype with in vitro phenotype (phagocytosis, laccase activity and CSF survival) performed on a subset of 89 isolates revealed evidence of lineage-associated virulence phenotype, with the VNII lineage displaying increased laccase activity (p=0.001) and ex vivo CSF survival (p=0.0001). These findings show that Cryptococcus neoformans is a phenotypically heterogeneous pathogen, and that lineage plays an important role in cryptococcal virulence during human infection. Furthermore, a detailed understanding of the genetic diversity in Southern Africa will support further investigation into how genetic diversity is structured across African environments, allowing assessment of the risks different ecotypes pose to infection

    AIDS-related mycoses: the way forward.

    Get PDF
    The contribution of fungal infections to the morbidity and mortality of HIV-infected individuals is largely unrecognized. A recent meeting highlighted several priorities that need to be urgently addressed, including improved epidemiological surveillance, increased availability of existing diagnostics and drugs, more training in the field of medical mycology, and better funding for research and provision of treatment, particularly in developing countries

    A genomic appraisal of invasive Salmonella Typhimurium and associated antibiotic resistance in sub-Saharan Africa

    Get PDF
    Invasive non-typhoidal Salmonella (iNTS) disease manifesting as bloodstream infection with high mortality is responsible for a huge public health burden in sub-Saharan Africa. Salmonella enterica serovar Typhimurium (S. Typhimurium) is the main cause of iNTS disease in Africa. By analysing whole genome sequence data from 1303 S. Typhimurium isolates originating from 19 African countries and isolated between 1979 and 2017, here we show a thorough scaled appraisal of the population structure of iNTS disease caused by S. Typhimurium across many of Africa’s most impacted countries. At least six invasive S. Typhimurium clades have already emerged, with ST313 lineage 2 or ST313-L2 driving the current pandemic. ST313-L2 likely emerged in the Democratic Republic of Congo around 1980 and further spread in the mid 1990s. We observed plasmid-borne as well as chromosomally encoded fluoroquinolone resistance underlying emergences of extensive-drug and pan-drug resistance. Our work provides an overview of the evolution of invasive S. Typhimurium disease, and can be exploited to target control measures
    corecore